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Abstract 

Xhis nationwide study examined the relationships 
among gender, ethnicity, and poverty with fifth 
graders’ {n = 8,741) science performance. Extant 
fifth grade data files (2003-2004), from fhe Early 
Childhood Longifudinal Sfudy, Kindergarten Class 
of 1998-99 (ECLS-K), were used. An ANOVA 
test revealed that males performed significantly 
better than females on science assessments, and this 
difference was maintained across ethnic groups. Xhe 
science scores, in order of ethnicity from the highest 
to lowest scores, were White, Asian, Hispanic, and 
African American. Asian fifth graders showed the 
largest ethnic discrepancy between students above 
and below poverty. Xhe unique features of this study 
were the depth of the disaggregation of the data and 
statistical analyses. Disaggregation of data by all three 


variables revealed compounding consequences for 
students at the extremes. White “at/above poverty” 
males had the highest mean science IRX scale score, 
while African American “below poverty” females had 
the lowest mean score. 

Introduction 

Xhe demographics of the U.S. public school system 
are constantly evolving. Currently, the student 
enrollment is nearly 50 million, with ethnic minority 
students composing 43% of the population. In 2006, 
17% of school-age children were living below the 
poverty level. Xen percent of White students were 
below the poverty level, while 33% of African 
American students and 26% of Hispanic students 
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lived below the poverty level. Twenty percent of 
school-age children spoke a language other than 
English at home, and almost three-quarters of those 
students spoke Spanish (Planty et al., 2008). 

The No Child Left Behind Act (NCLB) of 2001 
advocated that all children can learn and that every 
school must teach (U.S. Department of Education, 
2002). Prior to the early 1990s, Stevens (1993) noted 
that the “opportunity to learn science cannot be 
analyzed because science is not tested,” and “we teach 
what is tested” (p. 33). Science curriculum was reduced 
during the first six years of NCLB to focus on reading 
and math (Cavanaugh, 2007), but in 2007 national 
science testing was finally required. 

In an increasingly technological world, it is imperative 
to develop early the minds of scientists, inventors, 
and engineers who will keep pace with a highly 
competitive international market. President Obama 
stated, “In a global economy ... a good education 
is no longer just a pathway to opportunity — it is a 
prerequisite. The countries that out-teach us today 
will out-compete us tomorrow” (U.S. Department 
of Education, 2009). Researchers need to track U.S. 
students’ science performance and determine where 
extra efforts are needed to improve science education 
for all students. The interactions of gender, ethnicity 
and socioeconomics have been considered essential 
components related to science performance research 
(Baker, 2002; Kahle & Meece, 1994; Krockover 
& Shepardson, 1995; Lynch, 2000; Rodriguez, 

1998; Stevens, 1993); however, they are absent in 
the National Science Standards (National Research 
Council, 1996), ignoring their importance and 
ramifications to science education (Rodriguez, 1997). 
The disaggregation of data using these variables 
was not required by the NCLB until 2002-2003 
(Education Trust, 2003). 

Theoretical Framework 

Gender 

Gender has been defined as “. . . a social construction, 
usually based upon the biology of one’s body” 
(Scantlebury & Baker, 2007, p. 258). Both genders 
inherited socio-cultural expectations and treatments 
that should not have had anything to do with their 
sexual designation (Haslanger, 2000). However, 
most cultures led females and males into different 
experiences so that males and females started 
school with different knowledge, expectations, and 
self-confidence in learning (Scantlebury, 1994). 
Researchers confirmed the presence of gender 


stereotypes in science learning. Gender stereotypes 
started as early as first grade science. Lemales had 
less confidence in their ability to learn science and 
had higher science test anxiety than males. Even 
though there were no gender differences with respect 
to how much students liked physical science, parents 
and male students perceived that males were and 
should be more competent in physical science. These 
views persisted, despite instances in which females 
outperformed males in science (Andre, Whigham, 
Hendrickson, & Chambers, 1999; Greenfield, 1996; 
Kanai & Norman, 1997). 

Sfudies conducted through large data sets showed 
congruent results. The Nation’s Report Card: Science 
2005 reported that males continued to score higher 
than females in science, despite improved science 
scores by both genders (Grigg, Lauko, & Brockway, 
2006). Maehr and Steinkamp (1983) investigated 
gender differences in science learning through meta- 
analysis. They reviewed articles from 1965 through 
1981, which included more than 14 million students 
from 20 countries. Small but definitive gender 
differences were found that favored males in science 
performance. Campbell, Hombo, and Mazzeo (2000) 
examined the NAEP data from 1969 to 1999 and 
found that males in the elementary and middle schools 
outperformed females on science achievement tests. 

However, a study conducted by Greenfield (1996) 
indicated this trend was disrupted when females 
enrolled in more or advanced science classes. In a 
study of high-achieving African American females 
in middle and high schools, Pollard (1993) found 
that they possessed characteristics congruent with 
successful White students: confidence in abilifies, 
support structures, and enhanced problem solving. 

Ethnicity 

Tracking students’ performance by ethnicities showed 
differences. In the early 1980s, Rakow (1985) found 
that Whites outscored other ethnicities at ages 9, 

13, and 17. In a rare instance in 1990, Asians had 
the highest science proficiency in the twelfth grade, 
but not in grades four or eight (National Science 
Eoundation, 1994). In 2000, despite a five percent 
increase in the population of students below poverty, 
fourth grade Hispanic and African American students 
narrowed the persistent gap with White students 
(Grigg et al., 2006). Between 1996 and 2005, fourth 
graders science performance scores, in order of 
highest to lowest, were White, Asian/Pacific Islander, 
American Indian, Hispanic, and African American 
(U.S. Department of Education, 2007). 
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Peng and Hill (1995) explained that children started 
school with similar attitudes about science, hut 
ethnic minorities grew less and less prepared for 
science as they proceeded through the upper grades. 
Lower science performance among ethnic minorities 
was considered a product of inadequate learning 
opportunities in the areas of curriculum, instruction, 
assessment, teacher education, school organization, 
educational policies and a failure to connect with 
students’ homes and community environments 
(Lee & Luykx, 2005, 2007). There was also an 
overrepresentation of ethnic minorities in the low 
track. Students in low track classes were rarely able to 
escape the hierarchical labels, because they became 
enmeshed in a subculture of poor performance and 
failure. In addition, the school staff participated as 
gatekeepers who were unlikely to change the status 
quo (Gilbert & Yerrick, 2001). Gutierrez and Rogoff 
(2003) warned that ethnic differences should never 
be used to rationalize limiting individuals from any 
ethnic/socio-cultural group. Recognizing both the 
importance and limitations of students’ cultures, 
Solano-Flores and Trumbull (2003) stated. 

Culture -free tests cannot be constructed because 
tests are inevitably cultural devices. Therefore, 
understandings of non-mainstream language and 
non-mainstream culture must be incorporated 
as part of the reasoning that guides the entire 
assessment process, (p. 9) 

Gaps in science achievement can be reduced by making 
science more relevant to the socio-cultural lives of 
students (Seiler, 2001). Lee and Fradd (1998) promoted 
the idea of instructional congruence, a process to 
reconcile academic content with students’ cultures and 
languages. By incorporating real world problems and 
interdisciphnary content in conjunction with community 
partnerships, a “connected science” could be created 
(Bouillion & Gomez, 2001). Parsons, Foster, Gomillion, 
and Simpson (2008) made this succinct summation of the 
impediments facing many ethnic minority students: 

Science is a specific way of knowing and 
doing, which requires a prescribed way of 
communicating; therefore, students who do not 
communicate in a manner commensurate to 
what is expected will face notable challenges in 
acquiring the competencies needed to succeed 
in science. Students from diverse cultural 
backgrounds exhibiting communication patterns 
contrary to those modeled in the science 
classroom will have difficulty learning new 
concepts, connecting new learning to their prior 


knowledge, and expressing their understanding to 
others, (p. 81) 

Poverty 

Between 1996 and 2005, significant differences in 
science achievement were found between students 
above and below the poverty level (Grigg et ah, 

2006; O’Sullivan, Lauko, Grigg, Qian, & Zhang, 
2003). The impact of ethnicity and poverty on 
science performance has been linked as early as third 
grade (Kohlhaas, Lin, & Chu, 2010). Middle school 
students in the Third International Mathematics and 
Science Study (TIMSS) were found to have a strong 
correlation between science performance and poverty 
across 45 countries (Yang, 2003). Poverty levels 
correlated with unequal opportunities related to the 
quality of instruction, school and home resources, 
and students’ language proficiencies (Hewson, Kahle, 
Scanflebury, & Davies, 2001; Kahlenberg, 1995a, 
1995b). Consequenfly, fhis lack of opporfunifies 
progressed fo more unmofivafed sfudenfs and lower 
feacher morale (Lynch, 2000). 

Accountability Systems 

Tobin, Rofh, and Zimmermann (2001) believed fhaf 
“nafional sfandards are a componenf of hegemony, 
which mainfains achievemenf gaps befween Whiles 
and African Americans” (p. 961). Allhough Ihe 
original inlenl of slandards-based reform (SBR) was lo 
promole equily in educalion, many of Ihe inlroduced 
policies have empowered slales and disempowered 
communilies. Schools alone may nol be capable of 
overcoming all Ihe socioeconomic inequalilies of our 
sociely (McDermoll, 2007). Accounlabilily systems 
widened Ihe gap befween high- and low-performing 
sfudenfs, because lime and money was spenl “leaching 
lo Ihe lesl.” Inilially, fhis increased scores yel deprived 
sfudenfs of an in-deplh curriculum, crealing a new 
form of discriminalion againsl high-poverly sfudenfs 
and schools (McNeil, 2000). 

Ironically, schooled knowledges and disciplines 
may, while offering cerlain freedoms and 
opporfunifies, al Ihe same lime furlher draw 
sfudenfs info dominanl projecls of nalionalism 
and capilalisl labor formalion or bind Ihem even 
more lighlly lo systems of class, gender, and race 
inequality. (Levinson & Holland, 1996, p. 1) 

Science reform needs lo go beyond Ihe key concepls 
and conformisl science praclices espoused by 
Ihe American Associalion for Ihe Advancemenl 
of Science (AAAS), Nafional Science Teachers 
Associalion (NSTA), and Nafional Research 
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Council (NRC) and develop alternative pathways 
for diverse students that are more democratic and 
inclusive (Eisenhart, Finkel, & Marion, 1996). Rivet 
and Krajcik (2008) suggested that project-based 
instruction be contextualized by using students’ prior 
knowledge and personal experiences. Educational 
systems must incorporate inclusive curricula that 
adhere to the “science for all Americans” philosophy, 
a science curriculum that allows “collateral learning” 
by using culturally sensitive instruction that crosses 
cultural borders (Aikenhead & Jegede, 1999). 

Disaggregated Data 

Between 1990 and 2004, Scantlebury and Baker 
(2007) found there were only 46 research articles 
that used both the terms “gender” and “race” with 
“science.” Kahle (2004) also noted a lack of research 
that connected gender, ethnicity, and science. Others 
recommended that future research in science explore 
the interactions of gender, SES, and ethnicity 
(Baker, 2002; Kahle & Meece, 1994; Krockover & 
Shepardson, 1995; Lynch, 2000; Rodriguez, 1998; 
Stevens, 1993). “We must view, race, ethnicity, class, 
and sociocultural identities in relation to gender” and 
“understand the role of family and community as 
socializing agents” (Krockover & Shepardson, 1995, 
p. 223). 

The lack of results by groups and subgroups 
reinforced stereotypes. “A great deal of information 
is revealed when data are reported by gender within 
ethnic groups” (Rodriguez, 1998, p. 214). Lee and 
Luykx (2006) recommended using disaggregated data. 
Disaggregated data has captured “the interaction of 
gender and racial/ethnic differences by addressing the 
issue of whether gender differences vary within racial/ 
ethnic groups to understand differences in educational 
achievement and opportunity across racial/ethnic 
groups” (Coley, 2001, p. 2). With subgroup-specific 
information, stakeholders can better identify 
actions needed to close the gaps (Lynch, 2000). To 
capture these subgroups, NCLB began requiring 
disaggregated data during the 2002-2003 school year 
(Education Trust, 2003). 

Purpose 

Literature showed the impacts of the individual factors 
of gender, ethnicity, and poverty on students’ science 
achievement but failed to examine the interaction 
of the three variables together. The purpose of this 
study was to expand research by simultaneously 
examining gender, ethnicity, and poverty level with 
fifth graders’ science performance. Disaggregating the 


three variables provides a more coherent expression 
of subgroups’ characteristics on science outcomes that 
are unachievable when examining a single variable. 

Method 

Data File 

To have meaningful results, the data set required 
adequate sizes in each of the subgroups. Thus, the 
Early Childhood Longitudinal Study, Kindergarten 
Class of 1998-99 (ECLS-K) was selected. The 
ECLS-K was initiated by the U.S. Department of 
Education, Institute of Education Sciences (lES) in 
response to a congressional mandate requiring an 
account of the status of education in the United States. 
The National Center for Education Statistics (NCES) 
was entrusted with the data collection, processing, 
distribution, and reports for this longitudinal project. 
Approximately 23,000 kindergarten children and 
parents participated during the first year (1998-1999) 
of the project. The fifth grade data were collected 
during the 2003-2004 school year and were released 
to the public in 2006 (Tourangeau, Nord, Le, Pollack, 
& Atkins-Burnett, 2006). 

Description of Samples 

The fifth grade ECLS-K data included students with 
learning disabilities, which were not of interest in 
this study and could have skewed the estimates. After 
removing this group of students, there were 8,741 fifth 
grade students. Among them, 48% were males and 52% 
females. The ethnic distribution was 58% White, 19% 
Hispanic, 11% African American, 7% Asian, and 6% 
Other. Eighteen percent reported living below the poverty 
line, while 82% were at or above the poverty line. 

Weights 

The ECLS-K used a multistage probability sample 
design to select a nationally representative sample 
of children attending kindergarten in 1998-99. To 
maintain sufficient sample size for analysis, small 
minority groups such as Asian, Alaskan, and Pacific 
Islanders were oversampled (Tourangeau el al., 2006). 
To resolve Ihe oversampling issue, sampling weighls 
were used lo balance and mainlain subpopulalion 
represenlaliveness. NCES considered Ihe dissimilar 
survey response rales across subpopulalions when 
calculaling weighls. Differenl analyses were required 
for each weighted variable. When determining child- 
level characlerislics or assessmenl performance, Ihe 
NCES advised using Ihe weighl variable C6CW0 
(Tourangeau el al.). For slalislical analyses, SPSS 
Professional version 16.0 was used, because if had 
Ihe funclion of adding weighls during analysis. The 
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ECLS-K weight variable was applied during ANOVA 
analysis using the recommended weight option. 

Gender. The ECLS-K students’ gender data were not 
collected at fifth grade but were verified fhrough a 
composife of fhe parenfs’ and feachers’ surveys given 
in kindergarfen, firsf, and fhird grades. The genders 
were coded dichofomously info “1” for males and “2” 
for females. 

Ethnicity. Sfudenfs’ efhnic dafa were nol collected 
af fiflh grade buf were verified fhrough a composife 
of fhe parenfs’ and feachers’ surveys given in 
kindergarfen, firsf and fhird grades. There were a fofal 
of seven ethnic groups defined by ECLS-K. In fhis 
sfudy, fhe efhnicifies were redefined info five groups 
fo obfain larger sample sizes in fhe minorify groups. 
The groups were: “1” While, “2” African American, 
“3” Hispanic, “4” Asian, and “5” olher. 

Poverty. The slandards used for delermining poverly 
levels in ECLS-K were consisfenf wifh fhose measures 
sef by fhe U.S. Census Bureau (2003). A combinafion 
of numerical condifions defermined fiflh graders’ 
poverly rankings. The household’s weighled average 
income, the size of the family unit, and the number 
of related children under 18 years of age were used to 
estimate students’ poverty levels. The poverty level 
variable was split into two classes, “below poverty” 
and “at/above poverty” (Tourangeau et ah, 2006). 

Measure of Dependent Variable 
The dependent variable for this study was fifth graders’ 
science performance. The ECLS-K measured science 
performance through 92 science items that focused on 
conceptual understanding and scientific invesligalion 
frameworks. The conlenl aligned wifh fhe 1996 NAEP 
frameworks and included earlh, physical, and life 
sciences. The ECLS-K reported sfudenfs’ science 
scores in raw score, 1-score, and Item Response Theory 
(IRT) scale score. This research is one componenl 
of a longiludinal research projecl exploring fhe 
achievemenl levels of science sfudenfs’ as Ihey proceed 
from elemenlary fhrough middle school. The science 
IRT scores (C6RISSCL) permil fhe compulalion of 
crilical eslimales of possible gains or losses belween 
grade levels. The variable of C6R1SSCL was used for 
fhe sfudenfs’ science scores. The range of achievable 
values for fhe fiflh grade science IRT scores was 
belween 0 and 92 (Tourangeau el al., 2006). 

Statistical Analyses 

A full 3 -way analysis of variance (ANOVA) model 
was designed lo lesl each of fhe Ihree main effecls, 
Ihree 2-way inleraclion effecls, and one 3 -way 


inleraclion effecl. The Ihree main effecls were 
gender, elhnicily, and poverly. The 2-way inleraclion 
lerms were gender-elhnicily, gender-poverly, and 
elhnicily-poverly. The 3 -way inleraclion was gender- 
elhnicily-poverly. Wifh Iwo subgroups for gender, 
five subgroups for elhnicily, and Iwo subgroups 
for poverly, fhe 3 -way ANOVA had a fofal of 12 
breakdown groups. The ANOVA model eslimaled 
each parameter by fixing olhers as conslanls; Ihus, 
eslimales of individual paramelers were nol under 
the influences of others in the model. In other words, 
true effects were obtained. If conducting this analysis 
through t-test. E-test, or two-way ANOVA, the effects 
would be overestimated, because effects from the 
second or third factors were not removed. Thus, with 
three factors, a 3-way ANOVA approach surpassed 
methods that could not examine all three factors 
simultaneously. 

Results 

The sample size, mean, and standard deviation 
of fifth grade science IRT scale scores before and 
after applying weights are displayed in Table 1. 

Sample sizes distributed more closely between 
genders, but ethnicity and poverty levels showed 
large discrepancies. Among the different ethnicities. 
White was 58% of the whole sample, Hispanic was 
19%, African American was 11%, and the other two 
groups were each under 10%. Between the poverty 
groups, the below poverty was 18% of the whole 
sample, white above poverty was 82%. Regardless, the 
smallest subgroup N count was 491, a large number 
for ANOVA. 

The average weighted fifth grade Science IRT scale 
score was 58, with a standard deviation of 14.39 and 
a range from 17 to 88. Eifth grade mates had a mean 
score of 59 {SD = 14.33), which was three points 
higher than females’ mean score of 56 {SD = 14.26). 
Students of at/above poverty had a mean score of 61 
{SD = 12.77), white students of below poverty had a 
mean score of 47 {SD = 14.79), a 14-point gap. White 
{M = 63, SD = 11.42) students performed the best, 
followed by Asian {M = 59, SD = 14.94), other 
{M = 53, SD = 15.54), Hispanic {M = 51, SD = 14.17), 
and African American {M = Al, SD = 13.81). The 
largest variance among ethnicities was 16 points 
between White and African American. 

Small or uneven sample sizes could have decreased 
the power of analysis; therefore, it was important to 
check the power of each statistically significant test. 
On the other hand, sometimes strong power might 
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Table 1 

Descriptive Statistics of Fifth Grade Science IRT Scores by Gender, Ethnicity, and Poverty 


Not Weighted Weighted 


Variable 

N 

M 

SD 

Min. 

Max. 

M 

SD 

Min. 

Max. 

All fifth-grade students 

8,741 

59 

13.87 

17 

88 

58 

14.39 

17 

88 

Gender 

Male 

4,193 

61 

13.53 

17 

87 

59 

14.33 

17 

87 

Female 

4,548 

57 

13.97 

17 

88 

56 

14.26 

17 

88 

Ethnicity 

White 

5,056 

64 

10.93 

17 

88 

63 

11.42 

17 

88 

African American 

928 

48 

13.86 

17 

86 

47 

13.81 

17 

86 

Hispanic 

1,631 

52 

14.17 

18 

87 

51 

14.17 

18 

87 

Asian 

623 

59 

14.40 

20 

86 

59 

14.94 

20 

86 

Other 

491 

54 

14.63 

17 

86 

53 

15.54 

17 

86 

Poverty 

Below poverty 

1,554 

48 

14.23 

17 

83 

47 

14.79 

17 

83 

At /above poverty 

7,187 

62 

12.50 

17 

88 

61 

12.77 

17 

88 


result in significant outcomes even when the effect is 
small. Thus, it is also imperative to evaluate effect size 
for its practical importance. The degree of freedom, 

F value, observed power, and effect size are shown in 
Table 2. As expected, the large sample size resulted 
in high powers (1.00). The F values were significant 
at the 0.001 level. This meant that independently and 
interactively the three variables — gender, ethnicity, 
and poverty — had statistically significant effects on 


students’ science performance. Investigations of effect 
size, partial eta squared, showed a range of 0.00 to 
0.116. Studies suggested that partial eta squared values 
below 0.01 is considered small effect size, around 
0.06 is medium, and 0.14 is large size (Barnette 2006; 
Kittler, Menard, & Phillips, 2007). The ethnicity and 
poverty main effects were within medium and large 
effect sizes, 0.116 and 0.056, respectively. Gender and 
interaction effects were small (0.001 to 0.006). 


Table 2 

Three-way ANOVA Test the Effects of Gender, Ethnicity, and Poverty on Students’ Science Performance 







Partial Eta 

Noncent. 

Observed 

Source 

SS 

df 

MS 

F 

Squared 

Parameter 

Power 

Gender 

2492000 

1 

2492000.00 

17250.39*** 

0.006 

17250.39 

1.00 

Ethnicity 

53800000 

4 

13450000.00 

93122.85*** 

0.116 

372491.41 

1.00 

Poverty 

24390000 

1 

24390000.00 

168880.02*** 

0.056 

168880.02 

1.00 

Gender x ethnicity 

120100 

4 

30025.09 

207.87*** 

<0.001 

831.48 

1.00 

Gender x poverty 

26331 

1 

26330.81 

182.30*** 

<0.001 

182.30 

1.00 

Ethnicity x poverty 

1790000 

4 

447464.22 

3097.90*** 

0.004 

12391.62 

1.00 

Gender x ethnicity 
x poverty 

531798 

4 

132949.43 

920.44*** 

0.001 

3681.77 

1.00 

Error 

410300000 

2840581 

144.44 





Total 

10060000000 

2840601 






Correeted Total 

587800000 

2840600 







NOTE. -R^ = .302 (ADJUSTED R‘ = .302). 

*** p <.001 
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Table 3 

Post Hoc Analysis of Ethnicity 


95% Confidence 
Interval 

Parameter 

B 

SE 

t 

Lower 

Bound 

Upper 

Bound 

Partial Eta 
Squared 

Noncent. 

Parameter 

Observed 

Power 

White 

8.90 

.55 

16.24*** 

7.83 

9.98 

.023 

16.24 

1.00 

African American 

-7.03 

.64 

-11.00*** 

-8.28 

-5.78 

.011 

11.00 

1.00 

Hispanic 

-2.44 

.60 

_4 Q9*** 

-3.60 

-1.27 

.001 

4.09 

.98 

Asian 

3.58 

.70 


2.20 

4.95 

.002 

5.10 

1.00 

Other 

0 









NOTE. ***p < .001 


A post hoc test was conducted to determine whether 
the differences among all possible paired-ethnicity 
groups (White vs. Asian, Asian vs. African American, 
etc.) were statistically significant at the probability 
of 0.001. A homogeneity test of variances among 
ethnicity groups indicated these groups distributed 
differently; therefore, unequal variance methods were 
used for post hoc analysis. The results showed that 
all of the possible paired groups were statistically 
significant (p < 0.001) (see Table 3). 

When investigating the interactions of gender and 
ethnicity, ten (2 x 5) disaggregated subgroups were 
analyzed. The mean scores of male and female ethnic 
subgroups had the same order: White, Asian, other, 
Hispanic, and African American (see Table 4). It was 
noted that in every ethnic subgroup, males scored 
higher than females by approximately three to four 
points. The statistical test on the interaction between 
gender and ethnicity was statistically significant 
(p < 0.001 ) (see Table 2). This result indicated that 
after keeping other effects constant, the interaction 
between gender and ethnicity showed impact on fifth 
graders’ science performance. 

When examining the interactions between gender and 
poverty, four (2 x 2) subgroups were tested. The below 
poverty group scored lower than the at/above poverty 
group within male and female groups (see Table 4). 
Additionally, males outperformed females within each 
of the poverty levels. The largest difference in mean 
scores was observed between males at/above poverty 
{M = 62, SD = 12.51) and females below poverty 
(M = 45, SD = 14.07). The interaction of gender and 
poverty was statistically significant in the ANOVA 
test (p < 0.001) (see Table 2). 


When testing the ethnicity and poverty, ten (5 x 2) 
disaggregated subgroups were compared. The average 
scores of ethnic groups had the same rank order within 
each poverty level; White, Asian, other, Hispanic, and 
African American (see Table 4). It was no surprise that 
at/above poverty students outperformed their ethnic 
counterparts by 8 to 15 points. The largest mean 
difference of 20 points was found between White 
males (M = 65, SD = 11.19) and African American 
females (M = 45, SD = 13.21). The ANOVA test 
indicated that the interaction between ethnicity and 
poverty level was statistically significant (p < 0.001) 
(see Table 2). 

The 3 -way interaction separated students into 20 
disaggregated groups (gender [2] x poverty [2] x 
ethnicity [5]). White males “at/above poverty” 

{M = 66, SD = 10.76) had the highest mean score and 
African American females “below poverty” {M = 40, 
SD = 12.76) had the lowest mean score, leaving a 26 
point gap (see Table 5 and Figure 1). A significant test 
of the 3 -way interaction supported the effect on fifth 
graders’ science performance (see Table 2). 

Discussion 

This study shows males performed better than 
females, which is consistent with the U.S. Department 
of Education’s (2007) report and Maehr and 
Steinkamp’s (1983) NAEP study. Eynch (2000), 
Kalenberg (1995a), and Arambula-Greenfield (1999) 
suspected a strong correlation between ethnicity 
and poverty and indicated that poverty was more 
responsible for student achievement variations. 

This study confirms fhaf fiffh graders’ efhnicify 
and poverfy have sfrong associafions wifh sfudenf 
performance (Grigg ef ah, 2006; Hewson ef ah. 
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Table 4 Table 5 

Descriptive Analyses of Bi-variables of Fifth Grade Descriptive Analyses of Tri-variables of Fifth Grade 

Science IRT Scores Science IRT Scores 


Bi-variables 

M 

SD 

Tri-variables 

M 

SD 

Male 



Male 



White 

65 

11.19 

White 



African American 

49 

14.02 

Below poverty 

58 

13.10 

Hispanic 

53 

14.44 

At/above poverty 

66 

10.76 

Asian 

61 

14.05 

African American 



Other 

55 

16.53 

Below poverty 

43 

14.93 

Female 



At/above poverty 

53 

12.19 

White 

62 

11.42 

Hispanic 



African American 

45 

13.21 

Below poverty 

46 

13.85 

Hispanic 

50 

13.81 

At/above poverty 

56 

13.46 

Asian 

57 

15.44 



Other 

52 

14.38 

Asian 



Male 



Below poverty 

51 

14.74 



At/above poverty 

63 

12.58 

Below poverty 
At/above poverty 

48 

62 

15.39 

12.51 

Other 





Below poverty 

44 

16.94 

Female 



At/above poverty 

62 

11.59 

Below poverty 

45 

14.07 



At/above poverty 

59 

12.82 

Female 



White 



White 



Below poverty 

56 

12.65 

Below poverty 

55 

12.11 

At/above poverty 

64 

11.01 

At/above poverty 

62 

11.06 

African American 



African American 



Below poverty 

41 

13.95 

Below poverty 

40 

12.76 

At/above poverty 

50 

12.54 

At/above poverty 

48 

12.45 

Hispanic 



Hispanic 



Below poverty 

44 

13.46 

Below poverty 

43 

12.75 

At/above poverty 

55 

13.09 

At/above poverty 

54 

12.66 

Asian 



Asian 



Below poverty 

47 

13.91 

Below poverty 

44 

12.47 

At/above poverty 

62 

13.73 

At/above poverty 

60 

14.43 

Other 



Other 



Below poverty 

44 

15.93 

Below poverty 

44 

14.64 

At/above poverty 

58 

12.79 

At/above poverty 

55 

12.90 


2001; Lee & Luykx, 2005, 2007). The impact of 
the interactions of ethnicity and poverty persists 
within the 10 subgroups’ outcomes. It is interesting 
that Asian fifth graders have the largest (15 points) 
difference between at/above poverty and below 
poverty groups. This result does not support the 
stereotypical misconception that all Asian students 
perform well academically. 

It was not surprising to find that children from at/ 
above poverty performed better that those from below 
poverty. However, the difference in the average scores 
between those two groups is one standard deviation. 
Study results provide a clear picture of the differential 
science performance gap between those above and 


below poverty. The research literature indicates that 
high-poverty schools provide inadequate learning 
opportunities (Lee & Luykx, 2007). Unfortunately, 
the disparate performance results from this 
investigation might be the product of economic biases 
such as the quality of the school the students attended. 

Findings indicate that all three variables contribute 
significantly to fifth graders’ science performance. 
Differences exist on each main effect. The smallest to 
the largest mean differences between subgroups are 
gender (3 points), poverty (14 points), and ethnicity 
(16 points). Significant consequences compound with 
the 2-way and 3 -way interactions between and among 
these variables. The gaps in science scores widen. 
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The largest gap is found in the 3 -way interaction 
which reveals a 26 point gap between White males 
“at/above poverty” and African American females 
“below poverty. It should be noted that below 
poverty females are the most vulnerable group. 

Among the 20 subgroups, the below poverty, ethnic 
minority females performed similarly, but below their 
male counterparts. Research findings concur with 
Scantlebury and Baker’s (2007) statement. 

Many people remain at the margins of science, . . . 
research still faces the challenge of considering 
gender, race, and socioeconomic status within 
accountability systems that want simpler answers 
than we can provide, (p. 279) 

Future Studies 

The ever evolving demographics of the U.S. require 
science research to address science for all Americans 
continuously (Gallagher & Anderson, 1999). It 
is important to keep track of students’ science 
achievement through disaggregated data for later 
comparison in longitudinal studies. It is equally 
important to investigate related causal factors. 


which are responsible for discrepancies in science 
performance. Most critical are the school related 
factors that can be changed, improved, or eliminated. 

Future studies should focus on the quality of science 
instruction received by the students. Schools can 
provide the life experiences necessary for improving 
academic performance for disadvantaged students 
(Gustafson, 2002). Disadvantaged students exhibit 
social-emotional factors that negatively influence 
achievement scores, but those factors can be dealt with 
in the schools (Becker & Luthar, 2002). Teachers are 
essential participants in creating effective alternative 
instructional approaches that meet the needs of 
diverse learners. Diverse classroom practices do 
not always correspond with the National Science 
Education Standards (NSES) (Eradd & Eee, 1999). 

The report Before It’s Too Late (National Commission 
on Mathematics and Science Teaching for the 21st 
Century, 2000) recommended increasing the number of 
high-quality science teachers by providing an ongoing 
system of professional development and incentives for 
improvement. Schools must attract and retain high- 
quality teachers by increasing salaries and incentives 
and by improving the school work environment. 
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Preparation of preservice science teachers should 
address gender issues including biases in the teacher, 
the curriculum, classroom practices, and research 
(Scantlehury, 1994). Preservice teachers need to 
confront their personal attitudes, beliefs, assumptions, 
and practices before teaching equitably in culturally 
diverse schools (Bryan & Atwater, 2002). 

A case is made for improving the school success 
of ethnically diverse students through culturally 
responsive teaching and for preparing teachers 
in preservice education programs with the 
knowledge, attitudes, and skills needed to do this. 
(Gay, 2002, p. 106) 
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